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Abstract 



A nonlinear model with response variable missing at random is studied. In order to 
[jj , improve the coverage accuracy, the empirical likelihood ratio (EL) method is con- 

sidered. The asymptotic distribution of EL statistic and also of its approximation is 
X^ if the parameters are estimated using least squares(LS) or least absolute devia- 
"j^ , tion(LAD) method on complete data. When the response are reconstituted using a 

"^ I semiparametric method, the empirical log-likelihood associated on imputed data is 

also asymptotically x^- The Wilk's theorem for EL for parameter on response vari- 
able is also satisfied. It is shown via Monte Carlo simulations that the EL methods 
^ I outperform the normal approximation based method in terms of coverage probabil- 

vQ ' ity up to and including on the reconstituted data. The advantages of the proposed 

method are exemplified on the real data. 
^^ I keywords: random nonlinear model; response missing at random; empirical likeli- 

•/^ ' hood; semi-parametric estimation; 

o 



1 Introduction 



It is very common in practice, for a model, to measure the regressors(covariates) 
but for varied reasons it is sometimes impossible to have all values of the 
response variable. The most widely used idea is to remove of model the ob- 
servations with missing data. An alternative solution is to consider empirical 
likelihood(EL) method which is a powerful nonparametric method for con- 
structing confidence regions of parameters. 

To our knowledge, previous theoretical and numerical investigations in liter- 
ature have focused for model with na is sing r esponse only in the linear case. 



The EL method, proposed by lOwenI (jl990l ) does not need the asymptotic 
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variance of estimator and it outperforms the norr aal approximati o n me thod 
in term of coverage probabihty for hnear models. IWang and Raol ( 12002| ) de- 
velops EL inferences for the mean of a response variable under regression 
imputatio n of missing resp onses for a linear regression model and random 
covariates. iQin et al.l (|2009[ ) c onstruct an EL statistic on parameter when re- 
gressors are deterministic and IXud (J2009aJ) if regressors are random, based on 
least squares(LS) method for hnear model Y = X*/3+£. ISun and Wangl (|2009[ ) 
consider the general linear model Y = H(X)*/3-|-£ with H(x) a known vector 
function and investigate a hypothesis test on the response variable. These last 
three papers impose the condition that the conditional expectation of error 
e with respect to covariate X is zero. If this hypothesis is not satisfied, in 
order to reconstitute the response variable the least absolute deviations(LAD) 
method can be used. One advantage of least absolute deviations estimation is 
that it does not require any moment condition on the errors to obtain asymp- 
totic normality. 

It is well known also that one outlier may cause a large error in a least squares 
estimator. This occurs in the case of fatter tail distributions of the error term. 



On the other hand, as iBail ( 119981 ) and iKim and Choi! (119951 ) indicate it, for 
heavy tailed distributions the LAD estimator is more efficient than LS esti- 
mator. 

Concerning the L AD estima t or in a complete nonlinear model we can r efer to 
following papers: lOberhoferl (119821 ) shows conditions for its consistency, [Weiss 
( 1l991l ) proves that this estimator is consistent and asymptotically normal in a 
dynam ic nonline a r mod el with neither independent nor identically distributed 
errors. ICiupercal ( 20101 ) gives the convergence rate w„, where (f„) is a mono- 
tone positive sequence such that f„ — )■ and nv^ — )• oo for n — )■ oo. It is 
well known that confidence regions based on the asymptotic normality could 
encounter large coverage errors in small sizes or if the error distribution has 
outliers. The LAD technique was already used in censored median linear re- 



gression model with missing data: see e.g. IZhao and Chen I (120081 ). 



For other relevant pap ers (^not exhaust i ve list ) on EL met h od for missing data 



i n a l in ear models see IWang and SunI (120071) , ISun et al.l (120091 ) , IWang et al. 
(l2004h . lYa^g et all (l2009f ). Ihiang et aP (l2009f ). 

In this paper, for a nonlinear random model Y = f(K;/3) + e with missing 
responses, some empirical likelihood ratios are constructed by using complete- 
case or imputed values. The nonparametric version of Wilks' theorem is proved 
for two cases: the parameters are estimated by least squares and least absolute 
deviations on complete data. The limiting distribution of EL statistic is x^? re- 
sults which can be used to construct confidence region on parameter. In order 
to complete data, the v alue i r nputed of missing value of variable response is 
obtained by generalising IXud (l2009al ) idea for linear model using a semipara- 
metric technique: the parameters regression are estimated by LS method and 
missing probability by a nonparametric method. We show that the empirical 
log-likelihood ratio on parameter based on the improved data is asymptotically 
chi-squared. The numerical simulations proves that EL methods outperforms 



the normal approximation method in terms of coverage probabihty up to and 
including on the reconstituted data. If the distribution of the errors presents 
outliers, the LAD method gives generally best results that LS method on 
coverage probability and on parameter estimators efficiency. In addition to 
that, if the expectation of error does not exist, as it is the case for Cauchy 
distribution, the normal approximation of LS estimator can not be satisfied. 
On the real data we obtain also that our semiparametric method gives more 
precise results to reconstitute the response variable that classic parametric LS 
method. 

The rest of this paper is organised as follows. In Section 2 we introduce model, 
assumptions and some notations. In Section 3 the Wilks' theorem for EL statis- 
tic and also for its approximation is given, when the parameters are estimated 
using LS or LAD method on complete data. The LS case is developed in 
Section 4, a reconstituted value for the response variable is introduced and 
asymptotic distribution of EL for response variable is obtained. Section 5 il- 
lustrates by simulation results that EL methods for nonlinear random model 
outperform the normal method and give very competitive coverage probabili- 
ties. An application to the real data is presented in Section 6. Finally, Section 
7 contains the proofs of the lemmas and of the theorems. 



2 Notations and model 

Let us consider following random nonlinear model: 

F, = /(X„/3) + £„ 2 = 1, ■■■,«, (1) 

where (£:j,Xj) is a sequence of continuous independent random vectors with 
the same joint distribution as {e, X) and f3 is a. dxl vector of unknown regres- 
sion parameters. More precisely, e is a random variable and X a p x 1 random 
vector of covariates. Let (3^ denote the true (unknown) of the parameter f3. 
With regard to the random variable e we make one of the suppositions: 
(HI) iE[ei|Xi] = and iEfef jX^] < oo, Vi = 1, ■ ■ ■ ,n. 

(Hlbis) lE[sign{ei)\'X.i] = and Si have continuous density e(t) satisfying 
e(0) > 0. 

These conditions are essential for the consistency and asymptotic normality 
of the LS, respectively LAD estimator. 



In following we use notation f (x, (3) = (9/(x, l3)/df3, f (x, /3) = ^^/(x, /3)/9/3l 

For a vector, let us denote ||.|| the Euclidean norm. 

Regression function / : T x F — )■ R, with T C MP, F C R'^ and random vector 

X satisfy the conditions: 

(H2) for all X G T and for /3 G F, the function /(x, /3) is twice differentiable 



in j3 and continuous on T. 

(H3) iP[||X|| > Mn] = o{n~^/'^) for any positive sequence M„ — >■ oo as n — )■ cxo. 
(H4) ||f(x, /3)||, ||f(x, /3)|| are bounded for any x G T and /3 in a neighbor- 
hood of /3°. 

Sets T and F are compacts. This, assumptions (H2) and (H4) are commonly 
used in nonhnear modehsation and are necessary for the consistency and for 
the asymptotic normahty of the LS or LAD estimator. Assume also that the 
model is identifiable: if /(X; (3) = /(X; /3*) with probability one, then f3 = f3* . 



For model ([T]), all the Xj's are observed, in exchange response variable Yi can 
be missing. Let be the sequence of random variables {Si)i<i<n defined by: Si = 
if Yi is missing and 6i = 1 ii Yi is observed. We suppose that Yi is missing at 
random(MAR): r[6i = l|Xi, Yi\ = IP[5i = l|Xi], for alH = 1, ■ ■ ■ , n. Consider 
the selection probability function: 7r(x) = 1P[6 = l|X = x] >0, Vx G T. 
The supposition 7r(x) > is a common assumption in the literature. The 
parameter f3 is estimated on the completely observed data by two methods: 
least squares(LS) method: 



^^^LS = argminJ^^jF, - /(X„/3)]2 (2) 

/3 i=l 



and least absolute deviations(LAD) method: 



n 



^n,LAD = argmin^^ilFi - /(Xi,/3)|. (3) 

To build the EL statistic, let us consider following functions, for i = 1,- ■ ■ ,n: 

g,,^5(/3) = 54F,-/(X„/3)]f(X„/3), 

gi,LAD{/3) = 6iSign{Y, - /(X„/3))f(X„/3) 
and let be also: 

1 " 

g(/3) = -E&(/3), 
"- i=i 

with gi(/3) either gi,L5(/3) or gi,LAD(/3)- The two estimators are a solution of 
the system of equations: g(/3) = 0. Under (HI), respectively (Hlbis), we have: 
iE[gj(/3)] = for gj = gj,L5, respectively Zi,LAD- The joint distribution of the 
errors Si and of Xj is unknown, but conditional mean, respectively median, of 
Si is zero. 

The empirical likelihood for /3 with complete-case data can be defined as (see 



Owed (|199Q|)): 

Ln{/3) 



{n n n '\ 



(4) 



Without constraint Z]r=iPigi(/3) = 0, the maximum of HiLiPi i^ attained for 
Pi = n~^. Then, the profile empirical likelihood ratio for (3 with complete-case 
has the form: 



Rnif3) = sup <n^P»5 J2Pi = ^^Y.PiSiil^) =^ 

(pir-,Pn)G(0,l)" U=l j=l j=l J 

The empirical log-likelihood ratio statistic evaluated at /3 is: 



(5) 



Uf3) = -2 sup J2 iog(^p.); Ep^ = 1, Ep*g«(/3) = . (6) 

(pi,-,Pn)e(o,i)" U=i 



j=i 



j=i 



Let A G M'^ be, a Lagrange multiplier. Then, Rn{f3) is maximised for pi 
n~^[l + A*gj(/3)]~^, where A satisfies the equation: 



o = -E- 



m 



(7) 



+ A*g.(/3)- 
Thus, ln{/3) can be written: 

n 

/.(/3) = 2 5:iog(l + A*g.(/3)). (8) 

In order to study the asymptotic properties of the EL statistic given by ([8]), 



let us consider following matrix: A = IE 



7r(X)f(X,/30)f(X,/30) 



and under 



£2(X)7r(X)f(X,/3°)f (X,/3°) 



. On suppose for the 



assumption (HI): B = E 

first matrix: 

(HA) A is a positive definite matrix. 

Let us notice that A is Fisher information matrix on complete data. 



3 Asymptotic properties 



We give first a classical result for a MAR model, lemma that turn out to be 
useful in the proof of the main results. 

Lemma 3.1 Under assumptions (HI), respectively (Hlhis) and (H2), (H4), 
(HA), we have: 

(^)j^^Y:^,S.{(3');^^^fMV), 



(ill) maxi<i<„ ||gi(/3°)|| = oip{n^/'^), 

with V = B for LS and V = A for LAD method. 



Let us consider following matrix: 

n 

B„ = n-i^g,(/3°)gK/3°). (9) 



j=i 



Following theorem gives the asymptotic distribution of the empirical log- 
hkelihood statistic ([S]), evaluated at the true value. 

Theorem 3.1 Under assumptions (HI), respectively (Hlbis), (H2), (H4), (HA) 

We can use Theorem 13.11 to get approximate confidence region for (3 or for 
testing the hypothesis: Hq : f3 = /3°. Since, by the proof of Theorem 13.11 (see 
Appendix) for the EL statistic we have: 

/4/3°) = B-' (^ Er=i sim) (^ Er=i g^m) (l + o^(l)), in order to cal- 
culate numerically ln{l3^) we can use the approximation: 

W)=B-' (^tslifB')] f^E&(/3°)l • (10) 



n--{ ) VV^i=i 



We state this as a corollary. 

Corollary 3.1 Under the same assumptions as in Theorem \3.1\ the asymp- 
totic distribution of 1^{I3^) is X^id)- 

Thus, an asymptotic (1 — a) confidence region for /3, based on EL statistic on 
complete data is: {[3]l*^{(3) < Ci^a;d} where Ci^a^d is the (1 — a) quantile of 
the chi-squared distribution with degrees of freedom d. It is very interesting 
to note that to construct the confidence region for /3 it is not necessary to 
calculate the Lagrange multiplier which intervenes in (j8]), observed data are 
enough. 

Asymptotic normality of LS and LAD estimators calculated on complete data 
is given by the following result. 

Theorem 3.2 (i) Under assumption (HI), (H2), (H4), (HA) we have: 

V^0n,Ls - /3°) „-^ ^M A-'BA~'). 

(a) Under assumption (Hlbis), (H2), (H4), (HA) we have: 



These theorem allows to give the normal approximation based confidence re- 
gion, expression which will be specified in Section 5. Then, on complete data, 
we have the choice between four statistics (/^(/3°) for LS, /5^(/3°) for LAD, 
/^n L5; l^n lad) ^^ ^^^^ hypotheses or to build the asymptotic confidence re- 
gion of model parameter (3. We see in Sections 5 and 6, by simulations and 
a model on real data, that approximated EL statistics are sharply superior 
to normal approximation given by Theorem 13.21 If error distribution presents 
outliers then /*(/3) for LAD method is recommended, otherwise it is better 
to consider /*(/3) for LS method. This last one will more be developed in the 
following section. The missing probabilities 7r(Xj) are estimated by a nonpara- 
metric method, this is going to allow to reconstitute the missing responses. 
On the observed and the reconstituted observations one defined a new EL 
statistic, which also satisfies a Wilk's theorem. Besides, numerically, it gives 
very competitive results (see Sections 5 and 6). 



4 Special case of LS estimator 



Following e.g. IXud ( l2009al ). iQin et al.l (|2009[ ) for linear model, we shall intro- 



duce the forecast of Yi, constructed by using LS estimator for parameter (3 
and a nonparametric estimator for probability 7r(Xj): 

Yn, = ^r, + (l- -^) /(X,- ^„,^^), 2 = 1, • • • , n, (11) 

7r(Xi) \ 7r(Xi)y 



with 7r (X,;) a nonlinear estimator for 7r(Xj), as in the linear regression IXue 
fl2009ah : 



7r(X,;l 



max{l,Er=i^/.((X.-x)//i)}^ 



where h = hn is a. positive sequence tending to as n — )• oo and K^ is a kernel 

function defined in M'^. The bandwidth /i„ satisfies: 

(H5) nh?^M~'^'^ -)■ oo and n/i^™'''''^^''^"^^ -)■ 0, as n -)■ oo, with the sequence 

Mn given in assumption (H3). 

The kernel function K, satisfies the classical con dition (imposed also for the 



linear model using the LS method of IXud (J2009aJ)): 

(H6) there exist positive constants Ci, C2 and p such that: C'ilL||,u||<p < 
K{u) < C2l||„||<p. 

Concerning the selection probability function 7r(X) let us make following reg- 
ularity hypothesis necessary in the study of its nonparametric estimator. 
(H7) p(x) has bounded partial derivatives up to order max(2, (i — 1) almost 
everywhere. 

Conditions (H5)-(H7) are usual assumptions for convergent rates of kernel 
estimating method. 



Let us denote ^° = 1E[Y] the mean of Y and cr^(x) = lE[e^\'X. = x] error 

variance. 

Following lemma gives the asymptotic normality for the sequence (l^,j) and 

other two similar results of Lemma 13.11 

Lemma 4.1 Under assumptions (H1)-(H6) and if Var[f(X., f3^)] < oo we 
have: 

(n) ^n^.iVn, -er£^w, 

(Hi) maxi<i<„ \Yn,i\ = opin^/"^), 
with W = Var[f(X, /3°)] + IE 



7r(X) 



Using similar arguments as for Theorem 13.11 we obtain the following result. 
We hence omit its proof. 

Theorem 4.1 Suppose that assumptions (H1)-(H6) hold, then for empirical 
log-likelihood for 0^ : 

{n n n ^ 

i=l i=l i=l } 

we have: Inxi^^) — ^ X^(l)- 



This result can be used to make test of hyp othes i s or to construct asymptotic 
confidence region for the response variable. IXud (J2009bl ) constructs a weight- 
corrected empirical log-likelihood ratio for ^° which is also asymptotically 
chi-squared. 
Let be now following functions constructed using the reconstituted response: 

gn,.(/3) = [r„,,; - /(X,; /3)]f (X,; /3), ^ = 1, • • • , n. 

Consider also the empirical log-likelihood associated at gn,i(/3): 

{n n n "> 

^log(npi); J2Pi = '^^^PiSnAf^) = '^f ■ 
i=l 



2=1 



i=l 



Then, the equivalent of ([8]) is: 

n 

/;(/3) = 2^1og(l + A*g„,(/3)). 
j=i 

Consider following lemma needed for Theorem 14. 2[ 
Lemma 4.2 Under assumptions (H1)-(H7), we have: 



:i2) 



(ui) maxi<i<„ ||g„,i(/3°)|| = ojpf"^/^^ 



7r(X)-i52(X)f(X;/3°)f*(X;/3°) 



n 



Following result shows that the empirical log-likelihood ratio on /3° based on 
the reconstituted data converges to towards x^id)- This theorem shows in a 
similar way as Theorem 13.11 then the proof will be omitted. 

Theorem 4.2 Under assumptions (H1)-(H7) we have: ln{f3^) — > X^{d)- 



From Theorem 14.21 one can construct an asymptotic (1 — a)-level confidence 
region for f3 using all available values for X and the reconstituted values for 
Y. 

Corollary 4.1 In a similar way in the complete data, Corollary \3.1\ the 
statistic lnif3) may be approximated by: 

m') = b:-' i^^tsUf^')] i^tsnAP')] , (13) 



with B*^ = n"^X;r=ign,i(/3°)gn,i(/3°)- The asymptotic distribution of l*Xf3°) 
isx^{d). 

This implies that for testing hypothesis Ho : (3 = (3^ we can use the statistic 
/;(/3°) with asymptotic reject region {(Fi,Xi, (5i)i<i<„; /;(/3°) > ci-a-d} where 
Ci-a;d is the (1 — a) quantile of the chi-squared distribution with degrees of 
freedom d. 

Remark 1 Since the con vergence rate of /3„ ^^^ to f3^ can be slower than 
^/^ (see lCiuperca ((20 lu) ). then Lemma \4^ can not be true and the analogue 



n 



of the Theorem 4-2 cannot be consider for LAD estimator. 



We can minimise ln{f3^) and we obtain another estimator /3„ of /3°, called the 
maximum empirical likelihood estim ator (MELE). Usi n g the same arguments 



as used in the proof of Theorem 1 in lQin and Lawless! (1l994l ). we obtain: 



Theorem 4.3 Under assumptions (HI), (H2), (H4), (HA) and: 

• ]E[gi^Ls{/3^)giLsi(^^)] ^5 positive definite, 

• dgi^Ls{(^)/d(3 is continuous in a neighborhood of the true value /3^, \\dgi^]:^s{/3)/d/3\\ 
and \\gi,Ls{f3)\\^ are bounded by some integrable function in this neighbor- 
hood, 

• the rank of lE[dgi^Ls{l3°)/df3] is d. 



• d'^gi^Ls{f^)/dP(3* is continuous in a neighborhood of the true value f3^ , 
||9^gj i5(/3)/9/3/3*|| is bounded by some integrable function in this neigh- 
borhood 

then 

(i) convergence rate of 0,^ is n^^^'^: ||/3„ — /3°|| = 0]p{n^^^^). 

(ii) ^iX-/^'') ^ ^d{0,VMELE), with Vmele = A-^BmeleA-\ 

Bmele = iE[7r(X)£2(X)f (X; /3°)f*(X; /3°)]-iE[7r(X)f (X; /30)f*(X; /3°)f (X; /3°)f*(X; (3^)]. 



5 Simulation study 



In this section we use Monte Carlo simulation to compare empirical likelihood 
method with normal method. For nominal confidence level 1 — a = 0.95, using 
the simulated samples, we evaluated the coverage probabilities (CP) of the 
confidence regions (CR) given by: 

- approximated empirical log-likelihood method on the completely observed 
data (TheoremEU): CRls = {/3; /;(/3) < ci_„;4, CRlad = {/3; /:(/3) < ci_„;4 
by LS or LAD method, respectively, where ci-a-^d is the (1 — a) quantile of the 
standard chi-square distribution with rf-degrees of freedom and ^j^(/3) given by 
(ffOD; 

- approximated empirical log-likelihood /i^(/3), given by flT3|) . associated at 

gn,i(/3) on the reconstituted data: CRls = {/3;^n(/3) ^ ci-a-A] 

- normal method, based on Theorem 13.21 

NCRls = {/3; ri(^„,i5 - /3°)*AB-i A(^„,^5 - ^°) < c^^^,,} and 

NCRlad = {(3; 4:ne{O)0^^LAD " ^°)*A0„,^^^ - /3°) < Ci_«;,}. 



Throughout this section, the kernel function is taken as the Epanechnikov 
kernel K{x) = 0.75(1 — x^)l|a;|<i, the bandwidth sequence hn = n~^l'^ which 
satisfies assumption (H5). We generate M = 2000 Monte Carlo random sam- 
ples of size n for X ~ A/'(l, 1) and for errors either Af{0,a^) either Laplace 
£(0, 0"^) or Cauchy C(0, cr^), with a = 1, 2. We consider following two cases of 
response probability under the MAR assumption: 

a) 7i(x) = . 8 + 0.2 |x — 1| if |a; — 1| < 1 and 0.95 elsewhere (similar the linear 
case of IXud ( 2009al )). The average missing rate is 0.91 and empirical mean of 



fc{X,i) is 0.905 for X ~ ^f{l, 1). 
b) 7i{x) = 0.8 for all x. 

Case a) unlike to case b) excludes most of the error outliers. 
The coverage probability are estimated by the frequency of the true values /3° 
falling into the confidence intervals in M=2000 simulations. 
We denote by CPls, CPlad, CPls, NCPls, NCPlad the coverage probabil- 
ities corresponding to the confidence regions CRls, CRlad, CRls, NCRls, 



10 



NCRlad respectively. 

All simulations, calculations of estimations and statistical computations were 
performed using R language. To calculate the LS estimations on nonlinear 
model we used nls function of package stats and on linear model Im function 
of package base. To calculate the LAD estimations function nlrq of package 
quantreg was used if / is nonlinear and rq function of the same package for lin- 
ear model. We used also VG AM package for random generation of the Laplace 
distribution by rlaplace function. To generate Normal and Cauchy distribution 
the functions rnorm respectively rcauchy of stats package are used. 



5. 1 Nonlinear model 



Let us consider following nonlinear function corresponding a two-compartment 
model: 

f{x- /3) = ^l^(e-^^-' - e-^n, (3 = (A, /J^), 

with the true values: P^ = 1 and /J^ = 1-5. 

In Figures [U El [3] a simulation of this model is plotted for n = 300, 7r(x) = 0.8, 
a = 1 for errors e ~ A/", C, C respectively. We represented with "solid circle" 
the reconstituted values Yn^i and with "triangle" the true complete values Fj. 
In Table [1] we get the coverage probability CPls, CPlad, CPls, NCPls, 
NCPlad in the case 7r(X) = 0.8 + 0.2|a; - 1| if |a; - 1| < 1 and 0.95 elsewhere 
when n = 300, n = 100. In Table [2] we get the same five coverage probabilities 
for the case vr(X) = 0.8. These two tables show that: 

(1) CPls is better than CPlad excepting for the case e ~ C(0, 2); 

(2) coverage probabilities on the reconstituted responses CPls is bigger than 
0.95 and has values very closed to values of CPls obtained on complete 
data; 

(3) NCPls, NCPlad are worse than coverage rate on the complete or re- 
constituted data, particularly for n = 100 or Cauchy errors. For Cauchy 
distribution we cannot calculate NCPls since the assumptions of Theo- 
rem [32](i), more precisely (HI), are not satisfied ; 

(4) Since case a) has tendency to eliminate the outliers of e, the rates of 
normal coverage probability are bigger for a) than for b). 

Tables [3] and H] contain empirical mean, standard- deviation of parameter esti- 
mations $n;LS and (3n]LAD fo^ ^^e two cascs of response probability. We deduce 
from these two tables: 

(1) there is no difference between the parameter estimations obtained for 
both cases of probability a) and b), for the same n and the same distri- 
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bution of e; 
(2) already known thing: for distributions with outhers (Laplace or Cauchy 
here) the LAD estimators have a standard-deviation smaller than LS 
estimators. 

Table O whether it is for the probability a) or b), the medians of LAD estima- 
tions are more close to the true values than the medians of LS estimations, if 
e have outliers. The precision of the estimators increases with n and in a less 
measure with the probability n^x). 



5.2 Linear model 



Consider now the simple linear model: Yi = Xj/3 -|- Si, with the true value of 
the parameter /9° = 10. In Tables M and [7| we get the coverage probabilities 
for the two case of vr(X) and for n = 100, 20. From these two tables we make 
the following remarks: 

(1) in general CPlad performs better than CPls when e ~ £ or C; 

(2) CPls is bigger than 0.95 and has very close values to CPls values; 

(3) similar of (3) for nonlinear case, except that NCPlad < NCPls] 

(4) with regard to both cases of probability tt{X): there is no difference be- 
tween the three coverage probabilities by estimated EL methods, contrary 
to the nonlinear case nor between the natural coverage probabilities. 

Tables Eland [HI the standard-deviation of LS estimators is much bigger than for 
LAD's when e ~ C. That comes from the outliers presence in nonlinear case, 
the algorithm to find the LS estimator can not converge, thus the estimations 
are not available. 



5. 3 Conclusion 



The coverage probabilities of CR given by normal method are lower than the 
nominal level 1 — a especially for small sample sizes. The simulation results 
show that in terms of coverage probability the empirical likelihood methods 
outperform the normal approximation method in particular when error distri- 
bution have outliers or its has a bigger standard-deviation. 
It is interesting to note that, if the distribution of the errors presents outliers 
the approximated EL for LAD method gives generally best results than that 
for LS method that concerns EL coverage probability or parameter estimators 
efficiency. Then, if e ~ Af, in order to test hypothesis Hq : f3 = f3°, it is 
recommended to take the test statistic ln{(3) given by (TTOl) for complete data 
or /^(/3), given by (IT^ for improved data using LS method. Let us emphasize 
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that ini/3) gives very close results to those of Inif^)- On the other hand, if e 
presents outliers, to test Hq : (3 = (3° it is recommended /J^(/3) given by (TTU1) 

at gi = gi.LAD- 



6 An application 



In R language consider Chwirutl data of NISTnls package to model the ul- 
trasonic response value Y function to metal distance value X using nonlinear 
function: 

The realizations of {Y,X) are known for n = 214 observations. In Figure H] 
we represented variable y=ultrasonic response function of regressor X=metal 
distance: "solid circle" for reconstituted values Yn^i, "triangle" for true com- 
plete values Yi. The LS parameter estimations on all 214 observations are: 
61 = 0.19, 62 = 0.006, 63 = 0.10. By 10000 Monte Carlo samples, we elimi- 
nate 20%, 50%, 80% values of Y. Consider that the true value /3° of /3 is the 
one obtained on 214 observations. In Table [10] we have the acceptance rate 
of hypothesis Hq : (3 = (3^ with respect to missing probability 1 — vr(X) and 
with respect to the estimation method (LS or LAD) on complete data using 
statistic /j^(/3) given by ( TTOl) . on reconstituted data using statistic /j^(/3) given 
by ( IT3I) . We observe that, even by eliminating 80% observations, the obtained 
estimations are very close to /3°. In Table [H] we find the empirical means, 
standard-deviations of differences Yi — Fj q; ^i ~ Yi, where: Y^ are the true val- 
ues of Y, Fj is the forecast for Yi using all 214 observations by LS method, 
^i,o = fi^i^/^Ls)- For i = 1,- ■ ■ ,n, Yi = Y^i is the reconstituted value of Yi 
by relation flTTj) . We deduce that the variable response is better reconstituted 
by fITT]) than by LS method on all observations. 



7 Appendix: proofs 



We give proofs for the results in Sections 3 and 4. 

In the following, we denote by C a generic positive finite constant not de- 
pending on n which may take different values in different formulae or even in 
different parts of the same formula. 



Proof of Lemma 13.11 (i) We apply the central limit theorem. 
(a) By the weak law of large numbers. 
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(in) We combine Lemma 3 of lOweru (119901) and results for nonlinear regression 
without missing data (see e.g. ISeber and Wildl (120031 ) ). ■ 



Proof of Theorem 13. II We take the Taylor's expansion of ln{f3^) given by (|8]): 
/n(/3°) = 2A*Er=i&(/3")-All2.iK.(/3°)gK/3°)A+A*0^(Er=ig.(/3°)(A*g,,(/3°))2) 
By a similar approach of lOwenI ( Il990i ). using also Lemma 13. ![ we obtain: 
||A|| = 0]p{n~^^'^). Using Lemma [3.1( iii) we obtain: 



n 



-i\t 



A*E&(/3°)(A*g.(/3°))^ = ojp n-i5:(A*g.(/3°))^ 



i=l 



j=l 



(14) 



Then, returning at ln{f3^), we have: 



in{(3')=2Y: 



i=l 



A*g,,(/3°) - iA*g.(/3°)g*(/3°)A 



;i+Mi))- 



(15) 



Relation ([7]) can be written also: 



= -5:g.(^°)-i5:g.(/3°)g*(/3°)A+i5:g.(^°)(A*g.(^°))^(l+A*g.(/3°))-^ 



n 



i=l 



n 



i=l 



n 



1=1 



On the other hand, using (IT^ . we obtain: 







-in 1 " 

^E&(/3°)-^E&(/3°)g'(/3°)A 



n 



i=l 



n 



i=l 



:i+oip(i))- 



(16) 



Thus 



^Eg^(/3°)g!(/3°) 



ra 



i=l 



1=1 



;i+oip(i))- 



(17) 



Relation ^ implies also: A* ELi gi(/3°) = [A*Er=i &(/3°)g*(/3°)A 
Ojp(l)). Then, expression ( 1T5l) of /„(/3°) becomes, using relation ( TT71) : 



:i + 



^n(/3°) 



A*E&(/3°) 



i=l 



/ 1 n 

;i+oHi)) = b;M— Eg*(/3°) 



^T=i 



1 n \ 



n T- 



By Lemma [3.1( 1) and (ii) we have: lnif3^) — > X^id)- 



Proof of Theorem 13. 2[ (i) By definition, the LS estimator is obtained as the 
solution of the system: = Y17=iSi{0n is)- Using Lemma [3nT i) we have: 

^-'/'Er=ig.(/3°) A -^,(0,B), then n-V2^^^Jg.(^ ) _ g^(/3°)] -A 
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A/'(0,B). This last relation implies: 



n 



-'J:[S^0n,Ls)-S^{f3')]=OJP^ 



n 



'1/21 



i=l 



On the other hand, using Taylor expansion: 

&(^n,L5)-&(/3°)=5^f(X.;^„,i5)[>^.-/(X.;^„,i5)]-5if(Xi;/3°)[>^.-/(X,:;/3°)] 

= fe [f (X,; ^„,^5) - f (X,; f3^)]+5, [f (X,; ^„,^5) - f (X,; (3^)] [/(X„ /3°) - /(X,; ^^^^^^ 

+5,f(X,;/3°) [/(X„/3°) -/(X,;^„^^5)] 
Since ^^^^^ ^-^ /3°, we obtain: gi(^„,i5) - &(/3°) = ^i^if(Xi; /3i,„)(^„,l5 - 

with /3i,„ = /3° + ni0„,i5 - /3°), /32,n = /3° + n2(;9„,L5 " /3°), «i, ^2 G [0, 1]. 
Under assumptions (HI) and (H4): 



n-' Er=i 



.(/3 



i\f-'n,LS) 



.m 



Taking into account relation (TT8| 



, we have: 



Combining these last two results, we obtain: 

n 

and claim follows. 



i=l 



C 



(19) 



^fd{o,B) 



ii) We apply IWeissI fll99l[ ). 



Proof of Lemma 14.11 (i) Let us consider following decomposition: 

:k T:=l{yn, - ^°) =S, + S2 + Ss, with: 



C 1 Y^n 



\/n 



7r(X,) 



+ /(X,;/3°)-eO 



•^2 - ;^ EILi ^*^i 



y/n 



1 



1 



7r(Xi) 7r(Xi) 



'S's — -^ ElLi [/(Xj; /3„^i5) - /(Xj; /3 ) 



1 - ^ 



*(X0 



For ^i : \/ar[^+/(X,; /3°)-0O] = Var[^]+Var[/(X,; /3°)]+2Cot;(/(X,; /J"), ^). 
By (HI) we have Cot;(/(Xi; /3°), ^) = 0. On the other hand: Var ' 



E 



7r2(X,) 



E 



7r2(Xi) 



E 



Si£_ 

(X.) 
£!(X) 



SiSi 



7r(X,) 



Since (Xj,£:j) are independent for differ- 
ent i, then the random variables ^/^\ — /(X^; /3°) — 6'° are also independent. 
Then, we can apply the central limit theorem: 5*1 — > A/'(0, W). 

For 5*3 . We make a limited development for 5*3 until order 2 around /3° and 
taking into account relation (fT9l) . hypothesis vr(x) > and (H4): 

^3 = 0n,LS-Py^J:ti(^-^) f(X,;/3°)(l+o^(l)) = OAn-'/'W{n'/') 



02p(l). By Lemma 3 of IXud (l2009al ) we have S2 = ojp(l), then the claim (i) is 
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proved. The proof of (ii) and (iii) is similar. 



Proof of Lemma 14.21 (i) Function gn,i{f3) can be written: 

gnAl3) = ^^ + (l - ^] [/(X.; P^,,s) - /(X.; /3)]f (X.; /3). (20) 



For the first term of the right-hand side of (1201) we have: 

^-1/2 J2 ^\^^l^p J^ M, fo, E Ux)-h\y.)iOL- /3°)f*(X; (3^) 



■ (21) 



On the other hand, using a hmited development, relation flT^ and assumption 
(H4): n-V2 ^-^^ (i _ ^) [/(x,; ^„^^^) - /(X,; /3°)]f (X,,; /3°) 

= ^^^^Er=i (l - ^) f*(X,;/3,,Jf(X,;/3°), with /3,,„ between /3° and 
/3„ ^5. Let us consider following random variable: 

S^{p) = n-^ ElLi (1 - ^) f*(Xi; /3)f (X,; /3°), for /3 in a n-^/^-neighborhood 
of /3°. If we prove: 

S'„(/3)=ojp(l), (22) 

with ojp(l) uniformly in (3, tacking into accou nt also (fT9ll, we obtain (i). 



Let us remind at first some results of the paper IXud (l2009al ). on the estimators 



Ti^X.i) of vr(Xj). Under assumptions (H3), (H6) and (H7) we have uniformly- 
over 1 < i < n: 

IE[7r{X,) - 7r(X,)]2 = Oiinhf^yM^) + ©(/i^-^^^.d-i)) ^ o{n-'/^). (23) 

In the same paper, following two random variables are considered: Wnji'x.) = 
ii'ft(Xfc-x)/max{l,Er=i-^/i(Xj-x)}, for x G T, j = 1, • • • ,nandC„(Xj) = 
max{l, Ci Y^jjki l||Xj-Xi|i<ph„} with Ci, p positive constants defined in assump- 
tion (H6). For this last random variable we have E [C~^(Xj)] = oip{n~^''^). 
We now turn to study 5'„(/3), which can be written, after a limited develop- 
ment: 

Sn{/3) = - E f 1 - -tIt) llf (X.; /3°)f +^^ E f 1 - ^) f (X.; ^)f (X, /3°) 

n^V ^(Xi)/ n f^^\ 7r(Xi)y 

(24) 

with /3 between /3° and f3. We denote decomposition (!24l) : Sn{f3) = Si + 
82- Using Cauchy-Schwarz inequality, relation ( 1T9|) and assumption (114), 
it is easily shown that 5*2 = ojp(l) uniformly in f3. We study now Si = 

IT^U (1 - ^) l|f(X.;/3°)f +iELi5. (;;(fc) - ^) l|f(X.;/3°)f = 5, 



'ir 
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Su- Second term ^12 can be written: ^12 = n'^ ELi <5 /^^;jg;f '^ ||f(Xj.; /3°)f - 

^"' Sr=i ^^SSSf l|f ^^- '^°)ll'- '^^^ ^^'^ ^^™ °^ "^12 is ojp(l) by m- For 
the first term of 012 we liave tlie decomposition: 

n-' Er=i 5,^i^ygP||f (X,; /3°)f ^ Ti + T2 + T3, witfi 
T2 



T3 = n-i Er=i ^ 



'- lf(X,;/3°)f 



-'jj' 



By Caucfiy-Scfiwarz inequality, assumptions (H4), 



Kx.) I 



H7) we liave: 



mT?] < S ELi ^ [^ [E •=! 1^4.(X,)[vr(X,) - vr(X,)]^|X, 

Tlien Ti = ojp(l). 

ForT2: E[T2] < ^ Er=i ^ fE-=i W^'(X.)1 < ^ Er=i ^ 



< Chin ^ — > for 77, — )■ cx). 



-)■ for 



n — )■ 00, then T2 = ojp(l). In a similar way: 



iK[T3]<^Er=i[iK(^l|f(X.;/3°) 



IX, 



E-=iW^„,(x, 



-^ 0. Thus 



Ti + T2 + T3 = Oip(l), what imphes 5*12 = ojp(l). 

Finally, for Sn: -E'[S'ii] = and with the same arguments as for T2: ]E[Sfi] — > 

for n ^- n, then Su = ojp(l). With all this, relation (l22l) is proved. 

the proof of (ii) and (iii) is similar. ■ 
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Normal errors 



Laplace errors 





Fig. 1. Response variable Y function regres-Fig. 2. Response variable Y function regres- 
sor X : "solide circle" for reconstituted val- sor X: "solide circle" for reconstituted val- 
ues Yn^i, "triangle" for true complete values ues Yn^i, "triangle" for true complete values 
Yi. Normal error e ~ A/'(0, 1), n=300. Yi. Laplace error e ~ £(0, 1), n=300. 



Cauchy errors 
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Fig. 3. Response variable Y function regres-Fig. 4. Variable Y =ultrasonic response func- 
sor X: "solide circle" for reconstituted val- tion regressor X=metal distance: "solide 
ues Yn^i, "triangle" for true complete values circle" for reconstituted values Y^^i, "trian- 
Yi. Cauchy error e ~ C(0, 1), n=300. gle" for true complete values Yi. 
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Table 1 



Coverag 


e probabilities 


for nonlinear model, 7r(X) 


= 0.8 + 0.2|x- ] 


- ) if \x 


- 1| < 1 




and 0.95 elsewhere, n - 


= 300, 100 


, error 


Gaussian, Laplace, 


Cauchy. 










n = 


300 








n = 


100 






e 


CPls CPlad 


CPls 


NCPls 


NCPlad 


CPls 


CPlad 


CPls 


NCPls 


NCPlad 


M(0, 1) 


0.977 0.825 


0.959 


0.981 


0.884 


0.975 


0.747 


0.956 


0.63 


0.507 


C{0, 1) 


0.971 0.881 


0.967 


0.991 


0.829 


0.978 


0.836 


0.95 


0.604 


0.453 


C(0, 1) 


0.983 0.991 


0.984 




0.293 


0.987 


0.963 


0.987 




0.586 


A^(0, 2) 


0.964 0.964 


0.945 


0.774 


0.423 


0.976 


0.957 


0.917 


0.86 


0.686 


£(0,2) 


0.977 0.979 


0.974 


0.455 


0.394 


0.978 


0.959 


0.978 


0.925 


0.439 


C(0,2) 


0.986 0.999 


0.988 




0.615 


0.987 


0.994 


0.987 




0.541 



Table 2 

Coverage probabilities for nonlinear model, tt{X) 

sian, Laplace, Cauchy. 



0.8, n = 300, 100, error Gaus- 







n = 


300 








n = 


100 






e 


CPls 


CPlad 


CPls 


NCPls 


NCPlad 


CPls 


CPlad 


CPls 


NCPls 


NCPlad 


Af{0, 1) 


0.970 


0.807 


0.960 


0.955 


0.923 


0.972 


0.780 


0.931 


0.508 


0.393 


C{0, 1) 


0.972 


0.872 


0.975 


0.986 


0.891 


0.976 


0.823 


0.975 


0.227 


0.392 


C(0, 1) 


0.985 


0.985 


0.987 




0.753 


0.987 


0.958 


0.987 




0.772 


Af{0, 2) 


0.979 


0.968 


0.949 


0.560 


0.298 


0.979 


0.958 


0.886 


0.159 


0.254 


£(0,2) 


0.979 


0.977 


0.969 


0.668 


0.288 


0.979 


0.965 


0.973 


0.353 


0.234 


C(0,2) 


0.986 


0.999 


0.985 




0.636 


0.991 


0.992 


0.990 




0.699 



Table 3 

Mean, standard deviation of parameter estimations for nonlinear model, tt{X) = 
0.8 + 0.2|x — 1| if |x — 1| < 1 and 0.95 elsewhere, error Gaussian, Laplace, Cauchy. 
True values /?? = 1, /?o = 1.5. 
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n 


mean 


mean 


sd 


sd 


mean 


mean 


sd 


sd 


error 


LS 


LAD 


LS 


LAD 


LS 


LAD 


LS 


LAD 


n = 300 










Ar(o,i) 


1 


1 


0.13 


0.16 


1.50 


1.49 


0.26 


0.31 


C{0,1) 


1 


1 


0.17 


0.14 


1.48 


1.50 


0.36 


0.28 


C(0,1) 


0.99 


0.99 


0.68 


0.15 


1.40 


1.50 


1.58 


0.33 


Ar(0,2) 


1 


0.99 


0.25 


0.28 


1.49 


1.51 


0.53 


0.62 


C{0,2) 


0.99 


0.98 


0.31 


0.26 


1.51 


1.52 


0.74 


0.61 


C(0,2) 


1.01 


0.95 


0.82 


0.36 


1.43 


1.61 


2.06 


1.02 


n = 100 










Ar(o,i) 


0.97 


0.96 


0.25 


0.29 


1.58 


1.59 


0.70 


0.77 


C{0,1) 


0.96 


0.95 


0.33 


0.29 


1.60 


1.61 


0.97 


0.86 


C(0,1) 


1.01 


0.94 


0.79 


0.83 


1.54 


1.66 


2.32 


2.09 


Ar(0,2) 


0.96 


0.94 


0.42 


0.54 


1.66 


1.70 


1.40 


1.71 


£(0,2) 


0.93 


0.91 


0.48 


0.65 


1.74 


1.81 


1.65 


1.93 


C(0,2) 


1.02 


0.87 


0.93 


0.90 


1.53 


1.85 


2.53 


2.21 
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Table 4 

Mean, standard deviation of parameter estimations for nonlinear model, it{X) = 0.8, 

error Gaussian, Laplace, Cauchy. True values /Jj* = 1, /32 = 1.5. 







/3i 








/32 






n 


mean 


mean 


sd 


sd 


mean 


mean 


sd 


sd 


error 


LS 


LAD 


LS 


LAD 


LS 


LAD 


LS 


LAD 


n = 300 










Af(0,l) 


1 


0.99 


0.14 


0.18 


1.50 


1.50 


0.29 


0.35 


£(0,1) 


1 


0.99 


0.19 


0.15 


1.50 


1.50 


0.40 


0.31 


C(0,1) 


0.99 


0.96 


0.70 


0.25 


1.51 


1.58 


1.87 


1.69 


Af(0,2) 


1 


0.98 


0.26 


0.31 


1.49 


1.52 


0.58 


0.70 


C{0,2) 


0.97 


0.96 


0.34 


0.29 


1.54 


1.57 


0.82 


0.72 


C(0,2) 


0.96 


0.96 


0.83 


0.37 


1.57 


1.55 


2.13 


0.95 


n = 100 










A/'CO.l) 


0.96 


0.95 


0.28 


0.35 


1.62 


1.66 


0.99 


1.21 


C{0,1) 


0.96 


0.95 


0.32 


0.31 


1.62 


1.64 


1.23 


0.94 


C(0,1) 


0.97 


0.95 


0.81 


0.99 


1.59 


1.73 


2.,32 


1.98 


Ar(0,2) 


0.94 


0.94 


0.44 


0.52 


1.70 


1.77 


1.51 


1.85 


/:(o,2) 


0.95 


0.93 


0.51 


0.51 


1.66 


1.75 


1.66 


1.65 


C(0,2) 


1.09 


0.89 


0.94 


0.58 


1.25 


1.83 


2.28 


2.04 



Table 5 

Median of parameter estimations for nonlinear model, two cases for tt{X), error 

Gaussian, Laplace or Cauchy. True values fS^ = 1, /32 = 1.5. 







/3i 








/32 






n 


case 


a) 


case 


b) 


case 


a) 


case 


b) 


error 


LS 


LAD 


LS 


LAD 


LS 


LAD 


LS 


LAD 


n = 300 










A/'(0,2) 


1 


1 


1 


1 


1.51 


1.5 


1.49 


1.5 


C{0,2) 


0.99 


1 


0.98 


0.99 


1.52 


1.5 


1.53 


1.5 


C(0,2) 


1.17 


1 


0.96 


1 


1.56 


1.5 


1.48 


1.5 


n = 100 










A/-(0,2) 


0.97 


0.96 


0.99 


0.94 


1.45 


1.5 


1.48 


1.51 


C{0,2) 


0.99 


0.99 


1 


0.99 


1.39 


1. 49 


1.39 


1.50 


C(0,2) 


1.18 


1 


1.29 


1 


1.81 


1.5 


1.62 


1.5 
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Table 6 

Coverage probabilities for linear model, tt{X) = 0.8 + 0.2|x — 1| 
0.95 elsewhere, n = 100, 20, error Gaussian, Laplace, Cauchy. 



if Ix — ll < 1 and 







n = 


100 








n = 


20 






e 


CPls 


CPlad 


CPls 


NCPls 


NCPlad 


CPls 


CPlad 


CPls 


NCPls 


NCPlad 


Ar(0, 1) 


0.95 


0.943 


0.954 


0.945 


0.802 


0.967 


0.921 


0.955 


0.994 


0.785 


C{0, 1) 


0.960 


0.991 


0.960 


0.962 


0.784 


0.965 


0.959 


0.965 


0.864 


0.757 


C(0, 1) 


0.985 


1 


0.986 




0.745 


0.984 


0.999 


0.979 




0.737 


JV{0,2) 


0.956 


1 


0.956 


0.983 


0.631 


0.948 


0.999 


0.948 


0.83 


0.612 


£(0,2) 


0.952 


1 


0.952 


0.889 


0.665 


0.967 


0.999 


0.965 


0.993 


0.588 


C(0,2) 


0.974 


1 


0.974 




0.537 


0.983 


1 


0.981 




0.526 



Table 7 

Coverage probabilities for linear model, it{X) 

Laplace, Cauchy. 



0.8, n = 500, 20, error Gaussian, 







n = 


100 








n = 


20 






e 


CPls 


CPlad 


CPls 


NCPls 


NCPlad 


CPls 


CPlad 


CPls 


NCPls 


NCPlad 


Af{0, 1) 


0.954 


0.948 


0.951 


0.964 


0.800 


0.961 


0.913 


0.943 


0.947 


0.848 


C{0, 1) 


0.950 


0.985 


0.944 


0.962 


0.811 


0.965 


0.958 


0.952 


0.956 


0.806 


C(0, 1) 


0.983 


1 


0.978 




0.769 


0.992 


0.999 


0.978 




0.656 


A/'(0,2) 


0.952 


0.999 


0.950 


0.949 


0.561 


0.968 


0.997 


0.946 


0.870 


0.630 


£(0,2) 


0.955 


1 


0.952 


0.958 


0.583 


0.964 


0.998 


0.946 


0.993 


0.620 


C(0,2) 


0.982 


1 


0.982 




0.54 


0.985 


1 


0.976 




0.481 



Table 8 

Mean, standard deviation, median of parameter estimations for linear model, 
'7t{X) = 0.8 + 0.2|x — 1|, if jx — 1| < 1 and 0.95 elsewhere, f3^ = 10, error Gaussian, 
Laplace, Cauchy. 



n 


mean 


mean 


sd 


sd 


median 


median 


error 


LS 


LAD 


LS 


LAD 


LS 


LAD 


n = 100 








Ar(0,l) 


10 


10 


0.07 


0.09 


10 


10 


C{0,1) 


10 


10 


0.10 


0.08 


10 


10 


C(0,1) 


9.94 


10 


93 


0.12 


9.98 


10 


Ar(0,2) 


10 


10 


0.15 


0.18 


10 


10 


£(0,2) 


10 


10 


0.21 


0.16 


10 


10 


C(0,2) 


10.7 


10 


59 


0.25 


9.92 


10 


n = 20 








A/'(0,1) 


10 


10 


0.17 


0.22 


10 


10 


£(0,1) 


10 


10 


0.24 


0.22 


10 


10 


C(0,1) 


10 


10 


189 


0.34 


10 


10 


Ar(0,2) 


10 


10 


0.34 


0.42 


10 


10 


£(0,2) 


9.97 


9.98 


0.50 


0.44 


9.98 


9.98 


C(0,2) 


8.97 


10 


29 


0.65 


10 


10 
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Table 9 

Mean, standard deviation, median of parameter estimations for linear model, 

7r{X) = 0.8, f3^ = 10, error Gaussian, Laplace, Cauchy. True value j3^ = 10. 



n 


mean 


mean 


sd 


sd 


median 


median 


error 


LS 


LAD 


LS 


LAD 


LS 


LAD 


n = 100 








AT (0,1) 


10 


10 


0.08 


0.10 


10 


10 


£(0,1) 


10 


10 


0.11 


0.09 


10 


10 


C(0,1) 


9.71 


10 


16 


0.13 


10 


10 


Ar(0,2) 


10 


10 


0.17 


0.21 


10 


9.99 


C{0,2) 


9.99 


10 


0.22 


0.18 


9.99 


10 


C(0,2) 


13 


10 


1.84 


0.26 


9.97 


9.99 


n = 20 








Ar(0,l) 


10 


10 


0.20 


0.23 


10 


10 


C{0,1) 


10 


10 


0.26 


0.25 


10 


10 


C(0,1) 


8.46 


10 


31.2 


0.38 


9.99 


9.99 


Ar(0,2) 


9.99 


9.98 


0.37 


0.47 


9.98 


9.98 


-C(0,2) 


10 


10 


0.53 


0.50 


10 


10 


C(0,2) 


9.22 


10 


34 


0.82 


10 


10 



Table 10 

The acceptance rate of hypothesis Hq : (3 = (3^ with respect to missing probability 

1-<X). 





1 - 7r(X) = 0.20 


1 - 7r(X) = 0.50 


1 - 7r(X) = 0.80 


complete data, LS method 


1 


0.996 


0.976 


complete data, LAD method 


0.993 


0.977 


0.983 


reconstituted data, LS method 


1 


0.992 


0.874 



Table 11 

Empirical means, standard-deviations of differences between Yi and its reconstituted 

(forcasted) value, with respect to estimation method and missing probability 1 — 

7r(X). 





1 - 7r(X) = 0.20 


1 - 7r(X) = 0.50 


1 - 7r(X) = 0.80 


mean(yi - Yifi^Ls) 


0.066 


0.066 


0.066 


can(Y, - Yifi^LAo) 


0.321 


0.321 


0.321 


mean(yi - Yi^Ls) 


0.002 


-0.003 


-0.0158 


sd(y, - Y,,o,Ls) 


3.35 


3.35 


3.35 


sd{Yi - Yifi^LAo) 


3.36 


3.36 


3.36 


sd(y, - Y,,LS) 


1.71 


3.44 


6.65 
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